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Abstract 

In the reordering buffer management problem (RBM) a sequence of n colored items enters a buffer with 
limited capacity k. When the buffer is full, one item is removed to the output sequence, making room for 
the next input item. This step is repeated until the input sequence is exhausted and the buffer is empty. The 
objective is to find a sequence of removals that minimizes the total number of color changes in the output 
sequence. The problem formalizes numerous applications in computer and production systems, and is known 
to be NP-hard. 

We give the first constant factor approximation guarantee for RBM. Our algorithm is based on an intricate 
"rounding" of the solution to an LP relaxation for RBM, so it also establishes a constant upper bound on 
the integrality gap of this relaxation. Our results improve upon the best previous bound of 0{\/\og k) of 
Adamaszek et al. (STOC 201 1) that used different methods and gave an online algorithm. Our constant factor 
approximation beats the super-constant lower bounds on the competitive ratio given by Adamaszek et al. This 
is the first demonstration of an offline algorithm for RBM that is provably better than any online algorithm. 



^Computer Science Department, Technion — Israel Institute of Technology, Haifa 32000, Israel. Email: 
noaelgScs. technion . ac . il. 

^The Rachel and Selim Benin School of Computer Science and Engineering and the Center of Excellence on Algorithms, The Hebrew 
University of lerusalem, lerusalem 91904, Israel. Email: yrabanigcs . hu ji . ac . il. Research supported by ISF grants 1109-07 
and 856-1 1 and by BSF grant 2008059. 



1 Introduction 



Problem statement and motivation. In the reordering buffer management problem (RBM) a sequence of n 
items of colors c(l), c(2), . . . , c(n) (taken from a finite set of colors C) enters a buffer with capacity k G N. 
When the buffer is full, one item is removed, making room for the next input item. This step is repeated until the 
input sequence is exhausted and the buffer is empty. Thus, the buffer can be used to permute the input sequence 
in a limited way. In the permuted output sequence, we are interested in the number of times there is a color 
change between adjacent positions. Out of all feasible solutions, the objective is to find a sequence of removals 
that batches items of the same color and minimizes the total number of color changes in the output sequence. 

Introduced in [ 19], this elegant model formalizes a wide scope of resource management problems in produc- 
tion engineering, logistics, computer systems, network optimization, and information retrieval (see, e.g., lfT9l l8l 
[171 El). For example, one of the motivating examples of lPT9l is batching cars by color in the paint shop of a 
car manufacturing plant to minimize the consumption of paint solvent used to wash spray guns each time the 
paint color is changed between two consecutive cars. Naturally, the buffer capacity is limited by physical space 
constraints, and the incoming stream of cars is dictated by the schedule of the assembly line. (This particular ap- 
plication is part of the ROADEF Challenge 2005 of the French Operations Research Society [10], see also [14].) 
Generally, in computer systems and production systems buffers are often prepended to subsystems to facilitate 
better control of their input (see ll20l[T8l ). so understanding how to optimize buffer utilization is a fundamental 
and important problem. 

Our results. We give the first constant factor approximation guarantee for RBM, improving on the best pre- 
vious bound of 0(\/log k) [2]. Our algorithm is based on "rounding" the solution to a linear programming 
relaxation that we recently proposed [6[. Thus, our work also establishes an 0(1) upper bound on the integrality 
gap of this relaxation, improving upon the best previous bound of 0(yJ\og k) (which is not explicit in J2j, but 
can be derived from their work). Most previous work on RBM (including the above-mentioned J6l|2l) discussed 
online algorithms. There are recent lower bounds on the competitive ratio of fl( -y/log kj log log k) for deter- 
ministic algorithms, and f2(log log k) for randomized algorithms (against the oblivious adversary) [2]. Thus, our 
(deterministic) algorithm shows, for the first time, that an efficient offline RBM algorithm can beat any online al- 
gorithm. We note that in some applications, e.g. the paint shop sequencing problem mentioned above, the setting 
enables an offline computing of a good solution. Moreover, proving strong upper bounds on the integrality gap 
of a natural linear programming relaxation seems to be one of the major stumbling blocks on the way to design 
randomized online algorithms that beat the deterministic lower bound. 

Our algorithm for transforming a fractional solution into an integer one, without increasing the cost by more 
than a constant factor, works in phases. A phase starts at the time reached by the previous phase. A phase has 
a time horizon target, which is the time that the fractional solution increases its cost by some small constant 
factor. The goal of a phase is to reach the target by evicting a constant number of colors. This goal might be 
impossible to achieve. In such a case, we use an intricate charging scheme that chooses colors to evict and charges 
their eviction to the past fractional cost of other colors. The main difficulty in the analysis is to prove that the 
conditions under which the simple strategy fails to reach the target imply that the charging scheme can be used 
to bridge the gap. Our proofs involve illuminating observations on the structure of fractional RBM solutions. 

Previous work. As mentioned above, RBM was introduced in Ifl9l , who gave an 0(log 2 /c)-competitive online 
algorithm for the problem. The guarantee was improved through a sequence of papers lPT2l 0, culminat- 
ing in the 0{^/\og k) bound of [2]. This was also the best known approximation guarantee for RBM prior to 
our work. While the algorithms evolved gradually, each paper uses completely different tools of analysis. The 
0(log kj log log A;) -competitive analysis of applies a dual fitting argument, using the same relaxation that we 
use in this paper. The later and better result of [2] does not use linear programming. However, their proof can be 
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modified to show that the 0(\/\og k) bound on the competitive ratio also holds when competing against a frac- 
tional adversary, and therefore the integrality gap of the [6] relaxation is O(y/\og k). As mentioned above, in 
lower bounds on the competitive ratio of f2(-y/log kj log log k) and f2(loglog k), respectively, were established 
for deterministic and randomized online algorithms, respectively. 

Beyond the implications of the online setting, not much was known about the offline case prior to our work. 
Recent work shows that the problem is NP-hard (UHl. Allowing resource augmentation, [9] give, for every 
e > 0, an 0(l/e)-approximation algorithm for RBM with the caveat that the approximation algorithm is allowed 
to use a buffer of size (2 + e) • k. This strengthens a similar result implicit in lfl2l . proving this for their online 
algorithm, but only for e = 2. The paper [ 12] also shows that the optimum for a buffer of size k can be at most a 
factor of 0(log k) larger than the optimum for a buffer of size 4k. In ifTTl. a matching lower bound of f2(log k) was 
established, so the above resource augmentation arguments cannot yield constant factor approximation guarantees 
for RBM. 

There are simple constant factor approximation algorithms for the complement objective of maximizing 
the number of adjacent pairs with no color change in the output sequence lfT6l |7| (the constants are 20 and 9, 
respectively). The minimization version that we consider here seems more adequate for the applications in mind, 
and it also seems more challenging. Clearly, if we expect successful batching into relatively long monochromatic 
sequences, then guarantees on the complement maximization objective do not guarantee good performance in 
terms of the minimization objective. 

For some applications, it is suitable to use more general cost functions to measure the cost of color changes 
in the output sequence. In particular, non-uniform costs, where the cost of switching to a color depends on the 
color, were discussed in lfT2~l l6l l2l . Metric costs, where the cost of switching between colors is determined by a 
metric on the colors, were discussed for the line metric in |[T5l[T3l and for general metrics in ifTTTl . None of these 
models are known to have constant factor polynomial time approximation algorithms. 



2 The Algorithm 

Consider an instance T of RBM that is given by the buffer size k and by a sequence of n items of colors 
c(l), c(2), . . . , c(n). Let C denote the set of colors that appear in the sequence. Our algorithm solves a lin- 
ear programming relaxation for X, and then uses the fractional optimal solution to derive an integer solution 
whose cost is at most a constant factor greater than the fractional solution we started with. We use a time indexed 
relaxation that was first defined in our previous work (SJ , where it was used in a dual fitting analysis of an online 
algorithm for the problem. For completeness, we define the relaxation and motivate it here. 

We use 0-1 variables Xij, for i = 1, 2, . . . , n and j = max{A; + 1, i}, . . . , k + n. An assignment Xij = 1 
indicates that the i-th input item is removed from the buffer at time j. The reordering buffer management problem 
can be expressed as an integer linear program IP on these variables. We require the following notation. For every 
input item i, let last(i) denote the last input item of color c(i), and for i / last(i) let n(i) denote the next input 
item of color c{i). For notational convenience, we put n(i) = k + n + 2 for alH = last(i). Then, IP is: 



minimize Ya=iY1 



n(i)-2 



j=max{i,k+l} 
k+N 

y~] = i vi (i) 

j'=max{i,fc+l} 
3 

^2 Xi :j = 1 Vj (2) 

x n(i),j ~ Xij-i > Vi ^ last(i); V? > n(i) (3) 
€{0,1} Vi;Vj>i. (4) 



s.t. 
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The constraints ([T]) guarantee that each item is eventually removed from the buffer. The constraints ([2]) guarantee 
that at each time slot one item is removed from the buffer. The constraints ([3]) prevent the solution from switching 
colors while there are still items of the current color in the buffer. These constraints are needed to guarantee that 
the linear objective function measures the cost of the solution correctly. Notice that the objective function simply 
counts the number of items that are removed from the buffer before the next item of the same color is encountered 
in the input. Without constraints Q, we could avoid paying for color changes by keeping in the buffer just the 
last encountered item of a color c until the next item of this color is reached. We denote the optimal value of IP 
by zip. 

Proposition 2.1. The value of an optimal solution for X is exactly z\p. 

Proof Sketch. The obvious correspondence between RBM output sequences and feasible IP solutions matches 
RBM solutions and IP solutions with the same cost. ■ 

A linear programming relaxation LP is derived by relaxing the constraints (|4]) to 

> Vi; Vj > i. (5) 

We denote the value of the relaxation at a feasible solution x by z(x), and the optimal value by zlp- Given 
a feasible fractional solution x of LP (i.e., x satisfying constraints ([T]), ([2]), ([3]), and ([5])) and a time step j, we 
can think of x as defining a fractional packing of input items into the buffer at time j. I.e., every input item 
i < j is in the buffer with weight w\ = w(x)j where w\ = 1 — Yjj'=i x i,j'- F° r notational convenience, we 
define w\ '~ = 1. Notice that w is a function of x; we usually omit x from the notation. Also notice that due to 
constraints ([I]), w\ G [0,1]. 

Proposition 2.2. If j < n, then Yli<j w l = k , an d if 3 > n tnen J2i<j w \ = k + n — j. 

Proof Sketch. By constraints ([2]), for every k + 1 < j < k + n, it holds that Yli<j x i,j = 1- Therefore, if j < n, 

i<j i<j \ j'=i J j'=k+li<j' 



A similar argument shows the case of j > n. 

" " — " J ' ■ " ... - <i 

nyl) — n(i) 



Proposition 2.3. For every i ^ last{i),for every j > n(i) — 1, wj < w^^y 

Proof Sketch. If at some point j > n{i) — 1 we have vA > up rv then Yli>>i x i,j' = w { > ^ c\ — w ^ + ^ 



J2j'>j+i x n(i),j'> so at some point j' > j + 1, x n ^)j> — acij'-i < 0, violating constraints Q. 

Our main result is the following theorem. 

Theorem 2.4. There is a constant a > 1 and a (deterministic) polynomial time algorithm which given a feasible 
fractional solution x of LP computes a feasible 0-1 solution x of IP such that z(x) < a ■ z(x). 

Corollary 2.5. There is a (deterministic) polynomial time a-approximation algorithm for reordering buffer man- 
agement. 



Proof. Compute an optimal solution x* of LP with cost z(x*) = zlp < -^IP- Use Theorem 2.4 to compute a 0-1 



solution x* with cost z(x*) < az^ < az\Y>. The corollary follows from Proposition 2.1 
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We now describe the rounding algorithm of Theorem 2.4 The algorithm works in phases. Each phase evicts 
one or more colors from the buffer. To evict a color, the algorithm removes the items of this color from the 
buffer until the buffer contains no such item. We refer to the eviction of one color from the buffer as a step. The 
algorithm uses constants 61,62, 63 G (0, 1), and 7 = 7(^1, 62) > 1, to be defined later. In order to describe the 
algorithm, we need the following definition: 

Definition 2.6. For q = 1, 2, . . . , [z(x)/6s\, 

t q = min \t: ^ ^ Vi,j > q ■ 63 

j=k+l i<j 

where yij = xij if n(i) > j + 1, and y^j = otherwise. 

In other words, t q denotes the earliest time at which the cost of the fractional solution x increases to at least 
q • 63. The goal of phase q is to reach t q . (If the last t q was already reached, the goal of the last phase is to reach 
the end of the output sequence.) Each phase includes three types of steps. There are steps that are charged against 
the past x-cost of the items removed by that step. There are steps that are paid for by charging the past x-cost of 
other items in the buffer. Finally, there are a constant number of steps that cannot be charged to the past x-cost, 
so they are charged to the increase in x-cost that sets t q . 

For the second type of steps, we will use a charging scheme to determine the choice of colors to evict, and 
also to analyze the algorithm. For each item i in the buffer at time j we maintain an index Tjj which is the last 
time before j that i was charged. Initially, = i — 1. At time j, if i was not charged at time j — 1, we put 

= Otherwise, we put ry = j — 1. (The charge is implied by the sequence of charging times.) For 

each time step j, for each item i in the buffer of the algorithm at time j, define d\ = w] 1 ' 3 — w\. This is the 
fraction of i that the solution x removed from the buffer since the last time i was charged. 

We call the set of all items of a specific color in the algorithm's buffer a block. Let B J ' = {B{ , B\ , . . . , B ] m . } 
denote the set of blocks in the algorithm's buffer at time j (before removing from the buffer an item at time j). 
For r = 1, 2, . . . , rrij, let fi denote the earliest item in B 3 T . For an item i, we denote t(i) = mm{t : w\ < 1 — 61}. 
We assume that the blocks are ordered so that t(f{ ) < i^) < ■ • • < t(fmj)- We denote by Aj the difference in 
volume between the algorithm's buffer and the fractional buffer at time j. Formally, 



rrij 

A i = i|K-^n 1 =x;^(i-^)= ^ 
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r ~ l i£B J r i<j: -0)1=0 

where w = w(x). For a current time step j and a target time step t q , let 

P q = argm a x{|/| : I C [j, t q ] A Vi, i' G I,c{i) = c(i')}, 

and put 

t\ = max{j,t 9 + 1 - \P q \}. 

This is the earliest time where items from one color that arrive after time j — 1 can be removed from the buffer 
consecutively, reaching time t q or later. (Notice that if t q < j, then t\ = j.) Intuitively, if we reach t q without 
using items from I q , then in one more step we can reach our target t q . Consider a decision time j (the previous 
phase ended at time j — 1). We execute the following procedure: 

0. While our buffer contains an item i such that t(i) < j, we evict color c(i), and we increment j to be the 
first time step following the one we've reached. When there are no more steps of this case, we execute the 
first case among [TW] that applies. 
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1. If there is a color c that we can evict and reach t q , we evict one such color, thus ending the phase. 



2. If our buffer contains tq — j items from one or two colors, we evict th ose c olors in two steps, and if we 
haven't reached t q , we also evict the color of I q . We will prove in Claim 3.2 that we reach t q , so the phase 
ends. 

3. If Aj > i ■ (tq — j), then we do the following. For B G B 3 , put d J B = A • X^eB d{- Define s\ > S2 > 
• • • > s Pj inductively as follows. Initially set si = rrij. Assuming s p is denned, let r p < s p be the largest 

index for which Yju=t Y^ iG B j d i - fa\ B s p \- If X«=r ^ij ^ <*i» let r ' P e [ r P> s p] be tne largest index for 

which XX=r' ^ j — (See Figure 1 in the appendix.) We evict the color of block B J Sp , and charge the 

P 

items in B 3 r , , . . . , (i.e., for each charged item i, set Tjj+i = j). If r£ > 1, set s p+ \ to be — 1, else 

setpj = p. Otherwise, if Xu=r $ i < 5\ and r p > 1, set s p +\ = argmax{ | B 3 U \ : u G [r p — 1, s p — 1]}. 
Otherwise, if r p = 1, set pj = p. If the entire process removes fewer than tq — j items that were in our 



buf fer a t time j, we evict the color of the largest block B G B' J that remains. We will prove in Claims 3.3 
that at this point we can evict the color of I q and reach t q , thus ending the phase. 



and 



3.4 



4. In the remaining case, we evict the color of the largest block B G B- K We increment j to be the time step 
following the last output step. Now, we execute the procedure again. We will prove in Claim 3.5 that in a 
phase we never reach case|4]twice. 

This completes the definition of x. 



3 Analysis 

In this section we prove our main result, Theorem 2.4 We first give an interpretation of the feasible fractional 



solution x. Consider a color c, a sequence / of color c items i\,i2, ■ • • > im 

and a starting time j. Let Mjj denote 
the matching given by Mij(i s ) = j + s, for all s = 1, 2, . . . , m. We say that Mjj is a monochromatic sequence 
matching (MSM) iff the items are a maximal sequence of consecutive items of the same color c(i±). In other 
words, Mi j is an MSM iff it satisfies the following conditions: (i) for every s = 1, 2, . . . , m — 1 it holds that 

c(i s ) = c(i s+ i) and n(i s ) = i s +\', (ii) j + s > i s for every s = 1, 2, . . . , m; (Hi) j + m < n(i m ) — 1. 

Proposition 3.1. For every feasible fractional solution x there is a fractional packing of monochromatic sequence 
matchings A = X(x) that satisfies the following constraints: (a) for each input item i, j-iei = 1> (b)for 
each time slott, Y^jj:te\j+lJ+\I\] X IJ = 1; ( c ) = 52ij 

Proof Sketch. We can construct A by the following algorithm. While there exist such that > 0, 

find such a pair with minimum j. Find a maximal sequence / = (ii, i%, . . . , i m ) of items of color c(i±) with 
Xi st j+ S > 0. By constraints ((3) (which are maintained through the induction), it must be that Xi s j+ S > x^j+i 
and j + m < n(i m ) — 1. Put A/j = a^j+i and subtract Xjj from Xi s j +S for all s = 1, 2, . . . , m. (Notice that 
this will not cause constraints Q to be violated.) Constraints (a), (b) follow from constraints ([T]), ([2]) of the LP. 
Equation (c) follows from the fact that every MSM Mjj that we construct ends at time j + |/| which precedes 
the arrival of the next item of this color. ■ 

We next prove that the algorithm is well-defined. 

Claim 3.2. Executing case^ends a phase. 

Proof. Consider a phase q where we execute case [2] at time j. Let B,B' G B J denote the two blocks with 
\B\ + \B'\ >t? q — j, and let c, c' denote their colors. (If the t? q — j items stipulated by casejijare of a single color, 
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set B' = and d = c.) Recall that P q is the set of items that determine t{. Notice that l{ fl(BUB') = 0, because 
all the items in B U B' arrived before time j, and all the items in P q arrive at time j or later. Let b denote the 
number of items in P q that are removed when we evict the colors c, c'. Let /' denote the set of remaining items 
from P q . Notice that b > only if the color of P q is either cor c'. When we evict the colors of B, B', we reach 
tg + b — 1. As the items in /' can be removed starting from time tq + b and ending at time t q , evicting the color 
of /' ends the phase. ■ 

Claim 3.3. For every 81,82 > such that 82 > 28± there exists 7 = 7(^1, 82) such that applying the process in 
case^starting at time j removes at least t\ — j items that were in the buffer at time j. 

Proof. Let A^ = ^2 BeB j SieB ^\ ^ e tne uncharged portion of items that are removed by x, but the algorithm 
holds at time j. We start by showing that Aj > 1 ~ j;^ 2 • Aj. Each block B that the algorithm removed from the 

buffer before time j contributes J2ieB w l to A? ( and tne sum °f an those contributions is exactly Aj). Whenever 
the algorithm removes a block B, it charges some of the volume of the items that remain in its buffer and paid for 
removing B. The total volume charged is at most 82 • \B\. (This is trivially true when B is not removed during 
a case [3] process and does not charge anything.) If all the items that are charged when B is removed are not in 
the algorithm's buffer at time j, the same contribution of J2 i& B w l contributes to Aj as well. Now consider the 
case that the buffer does contain items that were charged for the removal of B. Let j' be the beginning of the 
case [3] process in which block B = B 3 S was removed, and let B 3 P be one of the blocks that were charged for 
removing B and its items are still in the algorithm's buffer at time j. By the ordering of the blocks in B 3 , it 

must be that t(f£ ) < t(f£ ). As we can't apply case O w 3 ., > 1 — 8\. Therefore, the first item fi of block B 

U % 

also has wP ., > 1 — Si. As every item in block B must have at least the same weight as the first item, we get 

Si 

that Yl,i^B w l — (1 — ^1) ' 1^1- Block B only charges at most a volume of 82 • \B\ of items that are still in the 
algorithm's buffer at time j. Putting Yli&B w l = (1 — ^) ' l^l> we § et tnat & contributes to Aj^ at least 

(i-e)-\B\-s 2 -\B\ > i-e-s 2 -(Si-e) _ 1-81- 8 2 



(i-0)-|£| - \-e-(Si-e) i-8! 

of the portion it contributes to Aj. 

Going back to the main argument, Let s e m > s e M.) > • • • > s e rn denote the indices of the blocks that 
we removed during the case [5] process ({s e (i), . . . , s e ^} C {si, s%, . . . , s Pj }). For each p G {I, ...,£}, by 

the definition of r' , s, V Se(p) 1 y~\ „, c£ < 82 ■ \Bi . A, and V Se(p) $ , > 5\. Consider the indices 
s e(p)+ij s e(p)+2> • • • j s e(p+i)-i- (Those are the indices defined in the process of blocks that weren't removed 
between removing block Bi,. and block B J Se(p+1) .) We now show that for each u G [e(p) + 1, e(p + 1) — 1], 
\ B L\ < f 1 • \ B L+i\ ( the same holds for u G [1, e(l) - 1] and u G [e(£) + l,pj - 1]): 



8%i 1 Su 1 Su 1 

/ Stt-1 \ 

= \B{ I • Id? 1 , + V d j , < 2«Ji • IBI ^ I. 
\ " g=r u / 

The first inequality follows from the definition of r u . The second inequality follows as B J Su is not removed by the 
process, therefore, B J Su+1 is defined to be the maximal block in {B 3 . v B 3 U , . . . , B 3 S The last inequality 
follows as Y^=l u ^ ^i- Furthermore, as the algorithm did not continue to execute casejojat time j, 
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d\ < 1 — w\ < 5i for every i G B 3 r _ 1 , and therefore the average over i G B 3 _ x of d| is also less than 5i. Thus 



^ e(p+l)-u 

in [s e ( p+1 ) + 1, s e ( p )+i] (blocks that weren't removed and weren't charged for removing any block) as follows: 



(or \e(p+i)—u ■ 
-j 1 j • \B J Se{p+1) I . We can now bound the contribution to A J of the blocks with indices 



s e (p)+l e(p+l)-l Su e(p+l)-l 

E E^ = E E E^ E (fc+*H*l 

9=s e(p+ i)+l i( z B J u=e(p)+l g=s u +i+l i( zBi u=e(p)+l 



s 

e(p+l)-l , . e(p+l)-u 



< + E (f) 



u=e(p)+l 

< (<y 2 + *i)-|i& 



1 ^(p+l) 1 

2tfi 25i(5 2 + 5i) 



e( P +l)l 5 2 _ 2<5 a 52-2*1 Vp+I)" 

Adding the contributions of block B 3 S , +1 . and the blocks that were charged for its removal, we get: 

S e(p) + 1 



= 5 2 (5 2 + 5i) , 
<5 2 -2<5i y '^^(p+d 1 52-25i 1 Mp+d 1 



For the same reason, 



and 



Therefore, if pj > e(£) then 



E E^¥#-K 



«5 2 _25i 1 Mi) 1 

e (i) 



e(f) + l 



^ 2^ a i S <5 2 _25i 1 
•'' 'V, 1 /\ />•:, 



s e(l) + l l-\ s e(p) + l «i=m(j) 

= EE<= E E^ + E E E^+ E E^ 

ff- 1 9= r Pj= l ieBi P- 1 9=r' e(p+1) ieB J g 9=r' e(1) i & B 3 g 



< 



5 2 (5 2 + 5i) 
5 2 - 25i 



l<l + EK( P )l)- < 6 > 



Thus, 



i Z^' «eo.)i - «5 2 (5 2 + 5i) J ~-5 2 (5 2 + -5i) l-5 : J 



P =i 



<5 2 - 25i _ 1 - 5i - 62 _ 1 
5 2 (5 2 + 5i) 1 — 5i 7 



> x/x ■ xV , x T-(^-i)- W 



Choosing 7 = 4^ • we get that \Bi p . \ + £* =1 \B 3 e(p) | is at least fj - j. 

To conclude, by the definition of the process we removed from the buffer at least ^p=i I^Lm I i tems from 
the items that were in the buffer at time j. Notice that if pj = e{€) we get that, similar to Equation ([6]), A? < 
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S2 S2-2sP ' 2p=l \ B L( P )l and for tne same reason as in Equation ([7]>, Y%=i \ B L( P ) I > 4 _ 3- Otherwise, it must 
be that the last block B J Sp . considered in the process was not removed. Thus, removing the largest block B G B 3 
that remained will remove at least \B 3 Sp . | items, and overall at least \B J Sp , | + Ylp=i \Bs e{p) | > 4 ~~ 3 items. ■ 

Claim 3.4. Executing case^ends a phase. 

Proof. Consider phase q where we execute case[3]at time j. Let j' be the time we e xecu te the last step of case|3] 



and let I be the set of items from I q that were removed before time j'. By Claim 3.3 at least t\ — j removed 



items were in the buffer at time j, therefore at least t\ — j + \I\ items were removed overall. Thus, f >t q + 
We can now evict the color of I q and reach t q , as there are at least \I° q \ — = t q — t q + 1 — |/| > t q — j' + 1 
items of this color that can be removed from the buffer consecutively starting at time j'. ■ 

Let cj) = « 1.618 denote the golden ratio. 

Claim 3.5. Assuming that 7 > jz^, if we 've reached case^^ then in the repeated execution of the procedure we 
execute one of the cases\l^\ 



Proof. Suppose that case|4]is executed at time j in phase q. We may assume that t q > j, otherwise the claim is 
vacuous. Let 

£ = {(/, J") : f <3 A / + \I\>t q A A/j/ > 0} 

denote the set of monochromatic sequences in the packing A that are matched to an interval containing the entire 
interval [j, t q \. (See Figure[2]in the appendix.) Notice that by the definition of t q and the fact that t q -\ < j < t q , 

A = A/j'>l-(53. (8) 

By the definition of t q , for every (I, j') G C, none of the items in Mj~j,(\j, tq — 1]) (the items matched by Mjjr 

to the interval [j, t\ — 1]) arrive at time j or later. As we've reached caseQ we may conclude that max{|i3| : B G 
B 3 } < t 3 q — j (otherwise casejijwould apply) and Aj < A ■ (t 3 q — j) (otherwise casej^Jwould apply). In particular, 

consider (I, j') G C. Let tjj> denote the minimum time if for which Mj~j,(t') is in the algorithm's buffer. If no 

such time exists, set trj' = t q . The items in Mfj,([j, tiji — 1]) are no longer in the algorithm's buffer at time j. 

Therefore, Y^,^ f)eC ~~ ■?') — Ar Using Equation ([8]), we conclude that E[tjj' — j] < ^j,^ (t q — j), 

where the expectation is taken over (I, j') G C with probability distribution Pr[(I, j')] = (See FigureJ^jin 
the appendix.) In particular, 

mm{t I}jl - j : (J,/) G £} < ]_ (t{ - j), (9) 



and by Markov's inequality 
Pr 

Let 



£1 -0 = V = IT0' <10) 



(-^min, Jmin) = axgmin{ijj/ - j : (I, f) G £}. 
Let B G B J denote the block containing items from I mm in the algorithm's buffer. Notice that by Equation 1 

i*i>(i-^H-* 
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As we've reached case|4](and therefore case[2]does not apply), for all other blocks B' G B^ , 

where the last inequality uses 7 > > j±^- Thus, the case step at j must evict the color of B. As case|lj 
did not apply, evicting the color of B does not reach t q . Denote 



£'={(I,i0 6£:^-i<^- y (^-i)} 



For every (I, j') G £' it must hold that the portion of I in the algorithm's buffer is B. This is because for all 
other colors ti d > - j > ( 1 - ^nhg^ ) (fq~j), and as 7 > we get t I>f - j > ( A ) (4~j)- So consider 



7(i-<5 3 ) ) V"<? j;,""""- f / i-5 3 ' «-5^"J,r ,/ ^ 7(i-<5 3 )' 
the situation after the step at j, where we remove B and possibly additional items of B's color, and we reach 
t' < t q . Consider the set A of the last i? q — j —\B\ items that the algorithm removed so far. For all i G A and 



for all (/,/) G M/ f(i) > if. Therefore, by Equation {Kj]>, wf +1 > ^(1 - <5 3 ) > i. On the other hand, 



w\ +1 = 0. Let t" be the point matched by M j min j> to the first item i' of I mm that wasn't yet encountered. 
Notice that for all (I, f) G M I)f (i') > t". Clearly, t" > Therefore, 

> E > 1 • (*" - *' " 1) > 1 • « +1 -*'-!)• 

When we execute the procedure again, we first execute case|0] Assuming that we haven't reached t q , the following 
holds. Each removed item moves our current position t' by 1, and we may lose ^ in our estimate of A t / + i for 
each increment of t'. Each removed item with the color of B moves the target t" by 1, but in those steps we do 
not lose - in our estimate of A f / +1 . (Notice that if we do not reach t q , then we haven't yet encountered this item 
in any of the sequences in £' and the reason for its removal must be other sequences.) Thus, with respect to the 
new t', we still have that A t / +1 > ^ • (t q +1 — if — 1), so if cases jlj and |2j do not apply, then case ijapplies. ■ 



We now analyze the charging scheme that is used in case|3| We say that a block B 3 U G that is charged at 
time j pays d J j (towards evicting the color of B J Sp for which r ' < u < s p ). Denote by W the set of blocks that 
are charged at time j. 

Lemma 3.6. J2j Y, B e& ^ < 2 ■ z(x). 

Proof. Fix j and consider a block B G B 3 which pays d? B at time j. Notice that 

1 

d B = rgT • Yl 4^ max K : * e B}. 

' ' ieB 

Let ib = argmaxj^ : ieB}. Then, d\ B is simply the sum of Xjji over (/, j') such that Mjji (is) G (rj B j,j']. 

So we can think of such (/, j') as contributing Xjj> towards the payment of cf B . If j' + |/| < j, then (J, f) will 
never be "asked" to contribute again. However, if j' + |7| > j, then I contains items that are not charged at time 
j, and such items may appear in a future block B' that pays for removing some block in a future phase. We argue 
that it must be the case that / + |/| < t q . To prove that, assume for contradiction that this is not the case. Notice 
that at time j the item is is still in the algorithm's buffer, and Mjji(is) < j. So if the algorithm evicts c(ib), 
it must reach t q . This contradicts the assumption that at time j the algorithm executes case[3j as case [TJ applies. 
Therefore, if (I, j') contributes again in a future phase at some time j", we have that j' + 1 J| < t q < j", so (I, j') 
will never contribute a third time. ■ 
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We are now ready to prove our main result. 

Proof of Theorem\2A\ We choose 8 1 ,8 2 ,8 3 € (0, 1) such that 8 2 > 25i and 7 = 1 '/-V/ 2 • .WfeSi) > 
(For example, we can choose 8% = ^j, 82 = jq, and 83 = |.) 

The number of phases is at most \z(x)/8s]. In a phase, cases [l]44] are executed at most once. The total 
number of steps due to case[TJ case|2j case|4| and the last two steps of case [3] is at most 4. (The worst case is 
when the algorithm executes casejijand then case|2 ) Therefore, the total cost of those steps is at most j^-z(x)+4. 

Now consider case[0] When a color c is evicted at time j, this is because there's a block B in the algorithm's 
buffer with i G B such that wj < I — 8\. In particular, vP t < 1 — <5i. We remove f B at time j, so for every 

monochromatic sequence (I, j') with Mjj/(f B ) < j, when this step is over we've reached j' + |/|. Those are 
the sequences that pay for the drop of at least 81 by time j in the weight of f B . As we've reached past them, we 
will never count them again for another step of caseKn Thus, the total number of such steps is at most j- ■ z(x). 

The remaining steps are color evictions due to the charging scheme of case [3] Notice that whenever a block 
B Sp G £> J is removed by this case, we have that Y^u=r' $ i — ^i- Therefore, the number of such steps is 

at most • J2j J2 p <=i^ S«=r' • w here P e denotes the set of indices of blocks whose removal created a 

charge at time j. By Lemma 



3.6 



■ Ej Epeii T,u= r > p d Bi ^ Tt ■ z ( x )- The total cost of a11 the cases 

is z{x) < ^ + jjj-j • z{x) + 4. As z(x) > \C\ > 1, we get that the approximation guarantee a satisfies 

a < f + f + 4. ■ 

— 01 63 



4 Concluding Remarks 

Our methods can be adapted easily to handle some additional constraints, such as incurring a color change cost 
whenever we've accummulated too many time steps without a color change (a constraint relevant to [ 10]). Numer- 
ical estimates of the best constant that the above analysis gives indicate that it is below 135, taking 8% w 0.02763, 
82 ~ 0.11416, and £3 w 0.18481. We did not attempt to optimize our analysis, however, it is unlikely that our 
methods can be pushed to yield a very small constant (such as 2). Substantially improving the approximation 
guarantee for RBM is an interesting open problem. Also, adapting our methods to deal with more general cost 
measures appears to be a non- trivial task. In a variant of RBM called the (uniform) /c-client problem flU, instead 
of a buffer there are k input sequences. At each time step, the next item from one of the sequences is chosen 
and moved to the output sequence. (So the choice of which item to remove affects the order of the combined 
input sequence.) The goal is the same as RBM: to minimize the number of color changes in the output sequence. 
Adapting our methods to deal with this setting seems to be another fascinating problem. 
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Appendix: Figures 
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Figure 1: The black area indicates the df-s, the dark grey area indicates the remaining portion of the Xij-s that 
was previously charged, and the light grey area indicates the w\ -s. 
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Figure 2: The strips of varying tones indicate the packing of MSMs in the fractional solution, and the outlined 
rectangles indicate the tij'-s. 
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